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INFERENCE FOR THE LIMITING CLUSTER SIZE 
DISTRIBUTION OF EXTREME VALUES 

By Christian Y. Robert 

CNAM and CREST, France 

Any limiting point process for the time normalized exceedances of 
high levels by a stationary sequence is necessarily compound Poisson 
under appropriate long range dependence conditions. Typically ex- 
ceedances appear in clusters. The underlying Poisson points represent 
the cluster positions and the multiplicities correspond to the cluster 
sizes. In the present paper we introduce estimators of the limiting 
cluster size probabilities, which are constructed through a recursive 
algorithm. We derive estimators of the extremal index which plays a 
key role in determining the intensity of cluster positions. We study 
the asymptotic properties of the estimators and investigate their fi- 
nite sample behavior on simulated data. 

1. Introduction. Many results in extreme value theory may be natu- 
rally discussed in terms of point processes. Typically, the distribution of 
extreme order statistics may be obtained by considering the point process 
of exceedances of a high level. More formally, let (Xn) be a strictly sta- 
tionary sequence of random variables (r.v.s) with marginal distribution F. 
We assume that for each r > there exists a sequence of levels (ii„(r)) 
such that lim^^oo nF{un{T)) = r, where F = 1 — F. It is necessary and suf- 
ficient for the existence of such a sequence that liinx—,xfF{x)/F{x—) = 1, 
where Xf = sup{u:F{u) < 1} (see Theorem 1.7.13 in [28]). A natural choice 
is given by m„(t) = F^{1 — r/n), where F^ is the generalized inverse of F, 
that is, F^{y) = mf{x G R:F{x) > y}. The point process of time normal- 
ized exceedances Ni^\-) is defined by Ni''\B) = ELi '^{i/neB,x,>u„{T)} for 
any Borel set B C E := (0, 1]. The event that Xn-k+i-.n, the kth largest of 
Xi, . . . , Xn, does not exceed Un{T) is equivalent to {N^\e) < k} and the 
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asymptotic distribution of Xn~k+i n is easily derived from the asymptotic 
distribution of N!i'\E). 

If (Xn) is a sequence of independent and identically distributed (i.i.d.) 

(t) 

r.v.s, converges in distribution to a homogeneous Poisson process with 
intensity r (see, e.g., [13], Theorem 5.3.2). If the i.i.d. assumption is relaxed 
and a long range dependence condition is assumed [A(n„(T)) defined below], 
the limiting point process is necessarily a homogeneous compound Poisson 
process with intensity 9t {9 > 0) and limiting cluster size distribution vr [24]. 
The constant 9 is referred to as the extremal index and its reciprocal is equal 
to the mean of vr under some mild additional assumptions (see [36, 38] for 
some counterexamples) . It may be shown that 9 <1 and that the compound 
Poisson limit becomes Poisson when 9 = 1. 

If lim„^oo^(iVn"^(^) = 0) = e-^^ then a necessary and sufficient condi- 

(t) 

tion for convergence of iVA^ is convergence of the conditional distribution 
of Nn\Bn) with Bn = (0, g„/n] given that there is at least one exceedance 
of Unir) in {!,... ,gn} to vr, that is, 

(1.1) \\m^P{N'^\Bn)=m\N^\Bn)>0)=TT{rn), m>l, 

where (g„) is a A(ti„(r))-separating sequence (see Section 3). Moreover, if 
the long range dependence condition A(m„(t)) holds for each r > 0, then 9 
and vr do not depend on r. 

The natural approach to do inference on 9 and vr is to identify the clus- 
ters of exceedances above a high threshold, then to evaluate for each cluster 
the characteristic of interest and to construct estimates from these values. 
The two common methods that are used to define clusters are the blocks and 
runs declustering schemes. The blocks declustering scheme consists in choos- 
ing a block length r„ and partitioning the n observations into /c„ = \ n/rn\ 
blocks, where [xj denotes the integer part of x. Each block that contains 
an exceedance is treated as one cluster. The runs declustering scheme con- 
sists in choosing a run length and stipulating that any pair of extreme 
observations separated by fewer than pn nonextreme observations belong to 
the same cluster. The block length r„ and the run length pn are termed the 
cluster identification scheme sequences and play a key role in determining 
the asymptotic properties of the estimators. 

The problem of inference on the extremal index has received a lot of 
attention in the literature. The first blocks and runs estimators were con- 
structed by using different probabilistic characterizations of the extremal 
index (see [13], Section 8.1, [1, 39]). They are determined by two sequences: 
the sequence of the thresholds u„(r) and the cluster identification scheme 
sequence. Their major drawback is their dependence on the threshold which 
is based on the unknown stationary distribution. Estimating this thresh- 
old is intricate since, by definition, it is exceeded by very few observations 
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[12]. To circumvent this issue, lower thresholds have to be considered. The 
following characterizations (see [27, 31]) 



lim SnP[ max Xi > Us„{t) / (r„r), 

" \l<i<r„ 



and 



9= lim P( max Xi < Us^{t) Xi > iis„(r) ) , 

n— >oo \2<t<Pn J 

where s„ = o(n), r„ = o(sn) and -pn = o(s„), have motivated other blocks 
and runs estimators [21, 22, 43]; the threshold Us„{t) can be estimated by 
Xn-inr/snl :n- Note that the estimators are determined by two sequences as 
well: r„ (or pn) and s„. More recently, new methods for identifying clusters 
of extreme values have been introduced in [26] and new estimators of the 
extremal index which are less sensitive to cluster identification scheme se- 
quences have been derived. However, to exploit these methods, it is necessary 
to know whether the process exhibits either an autoregressive or volatil- 
ity driven dependence structure and to choose an additional threshold to 
identify the clusters. In order to eliminate the cluster identification scheme 
sequences, [16] (see also [15]) proposes estimators which are based on the 
sequence of the thresholds Ur„{T) and on inter-exceedance times: a least- 
squares estimator, a maximum-likelihood estimator and a moment estima- 
tor. It is established that the last-mentioned estimator is weakly consistent 
for m-dependent stationary sequences. 

There are very few papers which investigate the inference for the limiting 
cluster size probabilities. In [21], condition (1.1) is used to motivate the 
following blocks estimators 

(1.2) vr„,(m;r„,..Jr)) = 

where y„j(us„(r)) = E-=(j_i)^„+i l{x,>n,„(r)}, s„ = o(n) and r„ = o{sn). 
Let Et^(T) = J2m=i T{m)'!T{m), where T is a function supported on {1,2, . . .}. 
The weak consistency of the estimators 

r-n 

r(m)7r„,i(m;r„,X„„Lnr/^i„J:n) 



m=l 



of Etj{T) is established. Note that they are determined by two sequences: 
Tn and s„. In [23] the following quantities are considered 

*„,.(m;r„.«,.(r)) = Si^.l^, )1k,.<...<.,)>.,1 _ 

Z.j = ll{y„,,K„(r))>0} 
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where R^' = F{Mf')/F{Mf>) and Mf' is the mth largest value of Xi, 
i = {j — l)r„ + 1, . . . ,jr„. A partial comparison with 7r„^i (m; r„, Us^(t)) is 
made under the assumption that F is known. Recently a new method has 
been proposed in [15]: a recursive algorithm forms estimates of the limiting 
cluster size probabilities from empirical moments which are based on the 
joint distributions of the inter-exceedance times separated by other inter- 
exceedance times. These estimators are only determined by selecting the 
sequence of thresholds Ur„{T). A consistency result for m-dependent sta- 
tionary sequences is given. 

In the present paper we introduce new blocks estimators of the limiting 
cluster size probabilities. The approach is the following. First we estimate 
the compound probabilities of the limiting point process. Second we use a 
declustering (decompounding) algorithm to form estimates of the limiting 
cluster size probabilities. This idea has been proposed recently in [5] and [6] 
where it is assumed that a sample of the compound Poisson distribution is 
observed (which is unfortunately not the case here). 

More specifically, let us denote by N^^ the weak limit of n!^\e) as 
n — > oo when it exists and by p^'^^ = {p^'^^ (?Ti))m>o its distribution. Let (Ci)i>i 
be a sequence of positive i.i.d. integer-valued r.v.s with distribution vr and 
r]{9T) be a r.v. with Poisson distribution and parameter Or such that 'r]{OT) 

is independent of the (Ci)j>i- We have A''^^^ = Cii with the convention 

that the sum equals if the upper index is smaller than the lower index. 
The distribution of is given by 

(1.3) p(^) (0) = Pir]{eT) = 0) = e-^\ 

(1.4) p(-\m) = Y,P{7ieT)=j)p[Y,Q=rr^) = \ ^ n*\m), 

j=l \i=l / j=l ^' 

m > 1, where vr*-' is the jth convolution of vr, that is, 

TT*^{m) = l vr(ii)---vr(ij), m>j. 

iiH Hj=m 

In risk theory the aggregate claim amount is often assumed to have a com- 
pound Poisson distribution. Panjer's algorithm [32] is a method to compute 
recursively the aggregate claims distribution when the distribution of a sin- 
gle claim is discrete and the distribution of the number of claims is Poisson, 
Binomial or Negative-Binomial. For the limiting compound Poisson distri- 
bution (1.3)-(1.4), the recursion is given by 



pW(0) =e-' 
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(t), ^ ln(pM(0)) ^ ■ (r)( ^1 

'{m)= /^JT^UIP {n^ — Jji m>l. 

^ j=i 

Note that the p^^-* (m) can be expressed as a function of the TT{j), j = 1, . . . ,m. 
It is possible to reverse the algorithm and to evaluate recursively the 7r(m) 
from the p^'^^ (j), j = 0, . . . , m, and the 7r(j) , j = 0, . . . , m — 1, in the following 
way 

+ ln(p(") (0)) Ej"!"/ Mj)?^^^ {m - j)) 
ln(pM(0))pH(0) 

(1.5) 

m > 1. 

Hence, the inversion of Panjer's algorithm provides an appealing recursive 
method to estimate the limiting cluster size probabilities. 

The content of the paper is organized as follows. In Section 2 we explain 
how we construct the estimators of the limiting cluster size probabilities. 
We also derive estimators of the extremal index. We emphasize that all our 
estimators are determined by one sequence and one (or two) parameter (s). In 
Section 3 we present and discuss technical conditions which are required for 
establishing the asymptotic properties. In Section 4 we give results on weak 
convergence of the estimators. In Section 5 we investigate the finite sample 
behavior of the estimators on simulated data and we make a comparison 
with existing estimators. Proofs are gathered in a last section. 

2. Defining the estimators. In the remainder of the paper we assume 
that Un{T) = F^{\ — T jn). The present approach to estimating the limit- 
ing cluster size distribution is based on the blocks declustering scheme. We 
divide {1, . . . ,n} into /c„ blocks of length r„, Ij = {{j — l)r„ + 1, . . . ,jrn} 
for j = 1, . . . , kn, and a last block Ik^+i — {^nkn + 1, . . . , n}. The number of 
observations above the threshold Ur„ (r) within the jth block is denoted by 

= E Mx,>ur„ir)}' j = l,...,kn. 

Since lim.„_»oo ^i^rl^j) = the parameter r can be interpreted as the asymp- 
totic mean number of observations which exceed the level Ur„{T) for each 

(t) 

block. The empirical distribution, ph , of the number of exceedances within 
a block is given by 

As mentioned in the introduction, the main issue when using these quanti- 
ties for estimating p^'^^ is that the threshold Ur„ (t) is based on the unknown 
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stationary distribution. It has to be estimated from the data. We define the 
estimator of p^'^^ by 



1 - , 



where N^l]- = J^^^j. l{x,>«.„(r)} and u^„(r) = Xk„rr._^k„r\:k„r„- 

Let us now consider the estimators of the hmiting cluster size probabih- 
ties. To ensure that the entries in (1.5) are nonnegative and that their sum 
does not exceed 1, we define recursively 

vrW (m) = max ^0, min (^Xn^ ("i), 1 - ^ T^n^ U)^ ^ , m > 1, 

where 

(pi^) (m) + In(pi^) (0)) ET^i J^n^ U)Pn^ (rn - j)) 

We also define smoothed versions by 

_ I f't' ( \ 

vr„(m) = / Tr)^'{m) dr, m > 1, 

(p — a Ja 

for given < a < 4> (see [35] for a similar averaging technique used to re- 
duce the asymptotic variance of the moment estimator of the extreme value 
parameter). 

Finally, let us derive estimators of the extremal index. This parameter 
appears in different moments of the distributions of N^^ and d (when they 
exist) 

P(<) = 0) = e-^^ E{Ci) = e-\ V{N^^^) = OrEiCif. 
Fix an integer m>\. We consider two approximations of 9 

Estimators of 9, 6*2 (m) and 9l^\7n) can be constructed by equating theoret- 
ical moments to their empirical counterparts 



^\ n("^) = — n • 
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^} „ can be seen as a slight modification of the estimator in equation (1.5) 

in [39]. 0^n{m) has been studied in [21] with (1.2) as an estimator of the 
limiting cluster size distribution and Tfi — r^. To the best of our knowledge, 
^3^n("^) seems to be new. Finally, let us define by the smoothed version 
of the first estimator 

(p-aJa ' 

All estimators (resp. smoothed versions of the estimators) introduced in 
this section are determined by the sequence r„ and the parameter r (resp. (j) 
and a). They provide an interesting alternative to the estimators introduced 
in [16] and [15] where it is only needed to select the sequence of the thresholds 
Ur„{T). Note that both methods share the same parsimony since in our case 
Urjr) is estimated by ^fc„r„-Lfc„rJ : fc„r„- 

3. Technical conditions. In this section we present and discuss technical 
conditions which are required for establishing the asymptotic properties of 
the estimators. We begin by giving definitions which are essentially due to 
[20, 27, 33]. 

The stationary sequence (Xn) is said to have extremal index 9 >0 if, for 
each r > 0, lim„_oo P{Ni^^ = 0) = exp(-6lr). 

Fix an integer r > 1 and ti > ■ ■ ■ > Tr > 0- Define J-p^q''"''^'^'^ as the a- 
algebra generated by the events {Xi > Unirj)}, p<i<q and 1 <j <r, and 
write 

an,l{Tl, ...,Tr) = sup{|P(^ H B) - P{A)P{B)\ : 

^G^;-----\SGji;---\l<t<n-/}. 

The condition A({n„(Tj)}i<j<r) is said to hold if lim„^oo Oin,i„{Ti^ ■ ■ ■ ,Tr) = 
for some sequence = o{n). The long range dependence condition 
A({tin(Tj)}i<j<r-) implies that extreme events situated far apart are almost 
independent. Of course, it is implied by strong mixing. 

Suppose that A({ti„(Tj)}i<j<r.) holds. A sequence of positive integers 
(g„) is said to be A({it„(Tj)}i<j<r)-separating if (/„ = o(n) and there exists 
a sequence (In) such that /„ = o{qn) and lim„^oo nq~^an,i„ (n , . . . , r^) = 0. 

We now present the technical conditions. The first one will be considered 
for "weak consistency" of the estimators. 

Condition (CO). The stationary sequence (Xn) has extremal index 
9 > 0. A(n.„(T)) holds for each r > and there exists a probability measure 
TT = (7r(i))j>i, such that, for i>l, 

(CO.a) 7r(i) = hm P(iVW(5„) = i|ivM(5„) > q), 
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with Bn = {0,qn/n], for some A(u„(r))-separating sequence {qn)- Moreover, 
there exists a constant p> 2 such that, for each r > 0, 

(CO.b) sup E{Ni^^\E))'' < oo. 

n>l 

(t) 

Condition (CO) ensures that the exceedance point process Nn converges 
in distribution for every choice of r > (see [24], Theorem 4.2). Let <v < 

p. Condition (CO.b) imphes that {N'j{^\E)y are uniformly integrable and 
linin-^oo E{Ni^\E)y = E{N^^^Y' < oo. In particular, the first and second 
moments of A''^^^ exist (see [4], page 338). They are given by E{N^^) = r 

and V{nP)=9tE{Ci)^. 

The following set of conditions will be considered for characterizing the 
distributional asymptotics of the estimators. 



Condition (CI). Condition (CO) holds. A(w„(ri),nn(r2)) holds for 
each Ti> T2> and there exists a probability measure tt2 = {Tr^^^'^^\hj))i>j>o,i>i, 
such that, for i>j>0, « > 1, 

(CLa) 4'^^'\iJ)=lim^PiNt\Br,) = i,NtHBr,)=j\Ni-^\B^) > 0), 
with Bn = (0,g„/n], for some A(n„(ri), ii„(r2))-separating sequence {qn)- 

Let us introduce the two-level exceedance point process N^^^'^^'' = {Nn^^\ 
A^^"""^) for n > T2 > 0. Condition (CI) ensures that N^T''"""^ converges m 
distribution to a point process with Laplace transform 

^«^p(-Ey^/*^^^''^) =ew(^-neJ\i-L{h{t),hmd?j, 

where N^'^'-^ is the ith. marginal of the limiting point process, /j > and L 
is the Laplace transform of vTg^^^^^^ (see Theorem 2.5 in [33] and its proof). 

Let us denote by {N^^'^^E^^) the weak limit of {Nt'\E) , Ni^^\E)) . By 
considering constant functions /j, we deduce that 

\ i=l 1=1 / 

where {Ci^f^'^^\ C2?^^^^)i>i ^ sequence of i.i.d. integer vector r.v.s with dis- 
tribution 'K^^^'^^^ and i]{6ti) is a r.v. with Poisson distribution and parameter 
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9ti such that r/(^Ti) is independent of the (Ci7''^^\ C2?^^^^) (see also Theo- 
rem 2 in [29]). The distribution ^j^^^'"""^ = {p^2''^^\hj))i>j>o of (A^^^'\ iV^""'^) 
is given by 

pfi'-2)(O,O)=P(7?(0n) = O) = e-^-S 

fc=i \/=i 1=1 / 



-0ri 



fc=i 



where yj-g^^''^^^'*'^ is the kth convolution of VTg^^''^^^ that is, 

fO, i<k, 

iiH \-ik=i 

iiH l-ifc=i 

Condition (CO.b) implies that Cov(Af{j'\ A^Jj'^) = 0Ti£;(c{yi^^'^C2a^^'^) is 
finite. 

Condition (C2). Let r > 2 and > 0. There exists a constant -D = 
D{r, (p) such that, for > ti > r2 > 0, 

(C2.a) sup^(iV^"i)(^) - Af^"^)(^))'' < D{n - rg). 

n>l 

Let Od > 3r/{r - (2 + /i)), where < // < ((r - 2) A 1/2). There exists a 
constant C > such that, for every choice of ri > • • • > > 0, m > 1, 

1 < / <n, 

(C2.b) an,Kri,...,r™)<a;:=C/^^''. 

(r„) is sequence such that — > oo and r„ = o(n) and there exists a sequence 
(/„) satisfying 

(C2.c) L = o(r^/'') and lim nr~^ai = 0. 

Note that condition (C2.a) provides an inequality which is quite natural 
to prove tightness criteria. Condition (C2.b) is satisfied by strong-mixing 
stationary sequences where the mixing coefficients vanish at least with a 
hyperbolic rate. The underlying idea to establish the asymptotic properties 
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of the estimators is to split the block Ij into a small block of length /„ and 
a big block of length rn — In- Condition (C2.c) ensures that is sufficiently 
large such that blocks that are not adjacent are asymptotically independent, 
but does not grow too fast such that the contributions of the small blocks 
are negligible. 

Finally, we need a condition on the convergence rate of r„ to infinity to 
guarantee that the extreme value approximations are sufficiently accurate. 

Condition (C3). Let m be an integer. The sequence (r„) satisfies 

lim y/K{T-rnF{Ur„{T)))=0 
n—>co 

and 

m 

locally uniformly for r > 0. 

Note that, if F is continuous, then rnF{ur„{T)) = r and the first part 
of Condition (C3) is obviously satisfied. We now discuss the example of the 
first order stochastic equations with random coefficients. A special case is the 
squared ARCH(l) process introduced in [14]. This process is probably one 
of the most prominent financial time series model of the last two decades. 

Example 3.1. Let Xq be a r.v. and let n > 1, be i.i.d. (0, oo)^- 

valued random vectors independent of Xq. Define X„ by means of the 
stochastic difference equation 

(3.1) Xn = AnXn-i + Bn, n>l. 

For sake of simplicity, we assume that the distribution of {Ai,Bi) is abso- 
lutely continuous. Kesten [25] proved that there exists a r.v. X, independent 

of {Ai,Bi), such that X = AiX + B. Assume that Xq has the same distri- 
bution as X, so that (X„) is a strictly stationary sequence. According to 
Corollary 2.4.1 in [8], (X„,) is also strongly mixing and absolutely regular 
with geometric rates. 

Further, suppose that there exist k > and ^ > such that 

EA1 = l, ^(^^max(log(^i),0)) <oo, 

EAl^'^ < oo and EB'l'^^ G (0, oo). 

Under these moment assumptions, results of Goldie [17] show that there exit 
c > and p> such that 



(3.2) 



F{x) = cx '^{l + 0{x P)), asx^oo. 
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We deduce that n„(T) = {cn/T)^^'^{l + 0{n~P/'^)) as n — > co. The one- level 
point process of exceedances was studied in [19] and the multi-level point 
process of exceedances in [33]. 

Now we successively verify that our technical conditions hold. Let R{x) = 
> l--X]\l^^Ai > x}, where P{X > x) = x"'^, x > 1, and define Ok = 
P{R{1) = k), k> 0. Using results in [19] and in [33], we see that A(ii„(r)) 
holds for each r > and that = 6o and 7r{k) = {9k-i — Ok)/0o, ^ > 1, for any 
A(n,„(T))-separating sequence such that = n*^ with < ? < 1. More- 
over, by Lemma 6.1 with ti = r and T2 = 0, we deduce that E[Nn\E))^ < 
oo and that Condition (CO) holds with p = 3. 

By [33], A(n„(ri), ii„(T2)) holds for each ti > T2 > and 



«4"'"'(i,j) = (p(fl(l) = = -l.-R((| 

•fi 



1/k 



1/k 



■J 



-p(K((f)'"')=i-i.fi(i)=i 

for any (qn) A(n„(Ti), u„(r2))-separating sequence such that Qn = n"^ with 
< ? < 1. Therefore, Condition (CI) holds. 

By Lemma 6.1, E{nI^^\e) - Ni^^\E)f < K{ti - T2) for (/> > n > r2 > 0. 
There exists a constant C satisfying (C2.b) for any 6^ > 9/(1 — fi), where 
< /X < 1/2 because (Xn) is a geometrically strong-mixing sequence. More- 
over, if r.„ = n"" with < ? < 1 and /„ = n'^ with < 7 < 2<;/3, then (C2.c) is 
satisfied. Therefore, Condition (C2) holds. 

Under the assumptions on {Ai,Bi), F is absolutely continuous and 
rnF{ur„{T)) = T. Let us use Lemma 6.2 with g„ = [n"J , = [ra^J, Sn = 
[n'^l , Xn = [n^l and r„ = [n*^] with < P < a <l, 0<7< 6 >0 and 
< < 1, then there exists a constant K such that 



knY.\p{N^:HE)=i)-p(^\i)\ 



1=1 



locally uniformly for r > 0, with % = (a — /5) A (1 — a) Aa Aj A5{k— e) Ap/n, 
<r] <1, <(p<l and < e < k. Finally, choose 1/(1 -h 2x) < C < 1 such 
that Condition (C3) holds. 
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4. Asymptotic properties of the estimators. To characterize the asymp- 
totic properties of the estimators, it is convenient to introduce D^j^ = D{[a, (p], 
R™') [resp. = Z)((0, oo), -R™')], the space of functions from [a,4>\ [resp. 
(0, oo)] to i?™' which are caglad (left-continuous with right-hmits) equipped 
with the strong Ji-topology (see [44] where the spaces of cadlag functions 
(right-continuous with left-hmits) are equivalently considered). Let us recall 
that weak convergence (which will be denoted by =^>) in is equivalent 
to weak convergence of the restrictions of the stochastic processes to any 
compact [cr, (/)] , Q < a < 4> < oo. 

We start this section by giving a "weak consistency" result. 

Proposition 4.1. Suppose that (CO) holds. Let (r„,) he a sequence such 
that r„ ^ oo and r„ = o{n), and < a < (j) < oo. Then 

(p(;)(0), . . . ,p(;)(?n)) (/)(0), . . . ,/)(m)) 
(7ri-)(l),...,7r(-)(m))^(7r(l),...,7r(m)) 



7f„(m) — > 7r(m), m>l and 0i,n^G. 

We continue with a series of results leading to a characterization of the 
distributional asymptotics of the estimators of the limiting cluster size prob- 
abilities. We first introduce the following centered processes: 

ej,n{-) = Vk^ip'hhj) - P(iviii =i))' j ^ 0' 

where 

OO -I kfi -I Tjikfi. 

i=l j=l i=l 

pii^ is called the tail empirical distribution and e„(-) the tail empirical pro- 
cess. They are very useful tools for studying the asymptotic properties of tail 
index estimators (see, e.g., [9, 34]) or for inference of multivariate extreme 
value distributions [18]. The weak convergence of the tail empirical process of 
strong-mixing (resp. absolute regular) stationary sequences has been studied 



in D 



m+l 



in Di 
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by [37] (resp. by [37], [10] and [11]). Note that the absolute regularity con- 
dition implies the strong-mixing condition which implies A({un{Tj)}i<j<r) 
for every choice of ri > • • • > r,. > 0, r > 1. The following theorem deals with 
the weak convergence of the process 

Em,n{') — (co,n (■) ' • • • ) ^m,n(') i ^n- (') ) 

in D"^'^'^. It will be useful throughout this section. 

Theorem 4.1. Suppose that (CI) and (C2) hold. There exists a path- 
wise continuous centered Gaussian process 

^m(-) = (eo(-)>--- e(-)) 
with covariance functions defined for <T2 <ti by: 

• if i = 0, . . . ,m, 

cov(e,(Ti),e,(r2)) = p^;''^'\i,i) - p(-^\i)p(-^\i^ , 

cov{e,in),eiT2)) = j2jP2"'"\iJ)-r2P^^'\i)^ 

j=0 
oo 

cov{ein),e,{T2))=Y.jp^P'^'\j,i)-np(^^\i), 
j=i 

• if ^ i < j ^ rn, 

cov(e,(ri),e,-(T2)) = -p^^'Hi)p^^'\j), 

• if <j <i <m, 

cov(ei(Ti),e,-(r2)) = p^;^'^'\i,j) - p(--\i)p(--\i:) , 

• and 

cov(e-(Ti),e(T2)) = -ln(p(-i)(0)) ^ ij4'^^'\i,j), 

0<i<i,l<i 

such that Em,n Em in 1)™+^. 

Let us compare the conditions in [37] that are needed for convergence 
of e(-) in the case of strong-mixing sequences with our conditions. First we 
have to impose that the threshold, in [37], is such that m„ = 0{ur^{T)). 
Then Condition CI in [37] is equivalent to our condition (C2.a). Condition 
D2 in [37] is slightly weaker than our condition (C2.b) and condition (C2.c) 

2 It 

since we also assume that = o{rn )■ Condition C3 in [37] is implied by 
our Condition (CI), but it appears as a natural sufficient condition when 

Un = 0{UrST)). 
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Now let us consider the estimators of the compound probabihties and 
introduce the following processes: 

eJ,n{■) = ^/K{pl^Hj)-p^■\j)), j>0. 



Theorem 4.2. Suppose that (CI), (C2) and (C3) hold. LetO<a <(j)< 
oo. Then 



in where 



(•))^(eo(-),---,e™(-)) 



ej{-) = ej{-) -hj 



and hj{T) = dp^'^\j)/dT\f=r- 

Note that the hj{-) satisfy the recursion 



... Infa)' '(0)) ^ . ,. 

3 h Vln(p()(0)) 

In order to address the asymptotic properties of the estimators of the 
limiting cluster size probabilities, we construct several processes. Following 
[6], we define recursively the processes dj{-) using the intermediate processes 



P 



(ln(p{-)(0))p(-)(0))- 



by do(-) = -eoi-)M Ho) and for j > 1, 



¥■)■■={ 



min|wj(-),-^(ii(-)|, 
max{0, Wj{-)}, 

|o,min|u'j(-),-^ 



max 



if 7r(j) > and ^vr(i) < 1, 

i=l 

j 

if 7r(j) > and ^vr(i) = 1, 

1=1 
j 

if 7r(j) = and ^vr(i) < 1, 
1=1 
j 

if 7r(j) = and ^7r(?) = 1. 



1=1 
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Note that the process dj{-) depends on the support of the hmiting cluster size 
distribution. It is not in general a Gaussian process because of the trunca- 
tions in its construction, except if 7r(i) > for i = 1, . . . , j and X]i=i < 1- 
In the following corollary we derive the weak convergence of the processes 

4n(-) = V^(4^(i)-vr(i)), j>l, 
and the asymptotic behavior of 

dj,n = VK{^n {j) - 7r(j) ) , i > 1 . 



Corollary 4.1. Suppose that (CI), (C2) and (C3) hold. Let < a < 
(p <oo. Then 

(dl,„(-),...,lim,n(-)) =^ {di{-),. .. ,dm{-)) 

in D'^, and 

d f I - 1 f'l' ^ \ 

{di^n,---,dm,n) 1^ ^ _ ^ J 4 (t) (^T, . . . , ^ _ ^ J dmij) dr j. 

We end this section by focusing on the estimators of the extremal index. 

Corollary 4.2. Suppose that (CI), (C2) and (C3) hold. Let <a < 
(j)< oo. Then 



in D^^ and 
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Note that the asymptotic variance of 9i ^ is given by 

It can be estimated by using the estimators of the hmiting cluster probabil- 
ities Ti'n'\j) and the estimator of the extremal index Oi^^. 

5. Simulation study. A simulation study is conducted to investigate the 
performance of the estimators on large samples and to make a comparison 
with existing estimators. 

(i) Performance on large samples. Data are simulated from three station- 
ary Markov processes: 

• a squared ARCH(l) process: X„ = (rj + XXn-i)Z^, n>2, where Z„ are 
i.i.d. standard Gaussian r.v.s, r] = 2x 10~^, A = 0.5 and Xi is a r.v. drawn 
from the stationary distribution of the chain. The limiting cluster size 
probabilities and the extremal index have been computed by simulations 
in [19]: 7r(l) = 0.751, tt{2) = 0.168, 7r(3) = 0.055, 7r(4) = 0.014, 7r(5) = 
0.008, 61 = 0.727. 

• a max-AR(l) process: Xn = max{(l — 9)Xn-i,Wn} , n>2, where Wn 
are i.i.d. unit Frechet r.v.s, 6 = 0.5 and Xi = Wi/O. By [33], 7r(l) = 0.5, 
7r(2) = 0.25, 7r(3) = 0.125, 7r(4) = 0.0625, 7r(5) = 0.031, 9 = 0.5. 

• an AR(1) process with uniform marginal: Xn = r~^X„_i + e„, n > 2, 
where (e„) are i.i.d. r.v.s uniformly distributed on {0, 1/r, . . . , (r — l)/r}, 
r = 4 and Xi is uniformly distributed on (0, 1). By [33], 7r(l) = 0.75, 7r(2) = 
0.1875, 7r(3) = 0.0469, 7r(4) = 0.0117, 7r(5) = 0.0029, 9 = 0.75. 

To compare the performance of the estimators, 500 sequences of length 
n = 2000 were simulated from the three processes. We have considered the 
ratios n'i^\j)/7rij) for j = 1,...,5, and 9fl{m)/9 for i = 2,3 and 

m = 8. The graphs show the average over the 500 samples. 

In Figures 1 and 2 the means and the root mean squared errors (RMSE) 

of the ratios are plotted as a function of ■ The bias of (1) is small and 
approximatively stable with respect to kn for the three processes. The biases 
of TTn\2) and 7rl^^(3) are small for the squared ARCH(l) process and the 
max-AR(l) process but large for the AR(1) process. 

For j > 4, the biases of the estimators can be relatively large and it seems 
very difficult to have good estimates of 7r(j) in the case of a data set of 
length 2000. The RMSE of the ratios increase dramatically with j because 
of the biases. Note also that a minimum of the RMSE with respect to k^, can 
not always be found. An optimal choice of kn based on the RMSE criterion 
will depend on the process and on the limiting cluster size probabilities. 
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250 



250 



Fig. 1. Means of the ratios of the cluster size probabilities vri (l)/7r(l), 7r„ (2)/7r(2), 
T^ri'\3)/Tv{3) , Tf^\4:)/'K{A), ni^\5)/ IT (5), and means of the ratios of the extremal index 

e\^l/0, 0^2]li^)/^ and ^^^,',(8)/^ as a function o/ A:„ = 50, . . . , 250 for the squared ARCH(l) 

process ( ), the maa;-AR(l) process ( ) and the AR(1) process (■■■■), The graphs 

show the average over 500 samples of length n — 2000. 



The bias of O^l^ is lower than those of 02 ni^) ^3ni^) squared 
ARCH(l) process and the max-AR(l) process. But for the AR(1) process, 
the bias of 9^\{m) is the smaUest. and ^2^^(w,) perform in the same way 

in terms of RMSE and better than 9^^^{m). 

(ii) Comparison with existing estimators on large samples. Data are sim- 
ulated from the squared ARCH(l) process defined below. 500 sequences of 
length n = 2000 were also used. For the limiting cluster probabilities com- 
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Fig. 2. RMSE of the ratios of the duster size probabilities vri^^ (l)/7r(l), 7ri^-'(2)/7r(2), 
Trl^^ (3)/7r(3), •7ri^'(4)/7r(4), ■k^\5)/'k{5), and RMSE of the ratios of the extremal index 



^I'l/S, (8) and 6I^Vi (8) « function of k„ = 50,..., 250 /or £/ie ^gwarerf ARCH(l) 



5(1), 



3(1) 



process ( ), the max-AK{l) process ( ) and the AR(1) process (■ 

show the average over 500 samples of length n — 2000. 



■ ). The graphs 



parisons are made between 7r„(z) = itn (j), vrn(j) with a = 0.7 and </> = 1.3, 
Hsing's estimators 7r„^i(j) with n/sn = and Ferro's estimators vTnO) 
with N = kn (see [15], equation (4.12)). For the extremal index comparisons 
are made between On = 9^n, ^i,n with a = 0.7 and cp = 1-3, Ferro and 
Segers' estimator 9n{u) with u = Xn-kn+i-.n (see [16], equation (5)), Hsing's 
estimator with n/s„ = kn/2 (see [21], page 137) and the runs estima- 
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50 100 150 200 250 SO 100 ISO 200 



Fig. 3. RMSE of the ratios of the cluster size probabilities Trn(l)/7r(l), 71,1 (2) /tt (2), 

•Kn{3)/n{3), 7r„(4)/7r(4), 7T„(5)/7r(5) as a function of kn — 50, ... , 250, for -kn = Tri"""^ ( 

-), 'kn=Tfn ( ), Tr„ — nn (Fcrro's estimators ■ ■ ■ ■) and -kn ~ nn,i (Hsing's estimator 

). RMSE of the ratios of the extremal index 6^/9 as a function of /c„ = 50, . . . , 250, 

for On ~ (- - - "A ^1 = ^1," ( A ^1 = (Ferro and Segers' estimator ■■■■), 

On = On (Hsing's estimator ) and 9„ = 6^ (runs estimator ----). 

tor 0^{p,u) with p= [r„/6j, u = ^„-[„/s„j:n and n/s„ = A;„,/2 (see [43], 
page 282). 

In Figure 3 the RMSE of the ratios 7r„(z)/7r(i) for i = 1, . . . ,5, and 6n/0 
are plotted. For the Umiting cluster size probabilities and the extremal in- 
dex, the smoothed versions 7f„ and Oi^n perform uniformly better than the 
unsmoothed estimators ttI^^ and O^j^j^ which perform uniformly better than 
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the other estimators [except for tt{2), where Hsing's estimator should be 
preferred to the unsmoothed estimator]. As Ferro and Segers' estimators, 
om' estimators only require the choice of a sequence, but their performance 
is more favorable. 



6. Proofs. Throughout we let K he a generic constant whose value may 
change from line to line. 

Lemma 6.1. Consider the first order stochastic equation with random 
coefficients of Example 3. 1 . There exists a constant K such that, for (f) > 
Ti > T2 > and n>l, 

E{Nt\E) - Ni-'\E)f < K{n - T2). 

Proof. Let /n(Ti,T2) = {un{Ti),Un{T2)]. By using the same arguments 
as in the proof of Lemma 4.1 in [10], we can show that there exists a constant 
K such that, for > ti > T2 > and n > 1, 

= PiXj e IniTl,T2)\Xi E IniTuT2)) < ^(^ + > i > ^ > 1, 

where (p = EAf < 1 for £^ G (0, k). By the stationary and Markov property, 
we get 

E{NtHE) - NtHE)f 

<3!n ■^l{Xie-fn{ri,r2)}l{X,e/„(ri,T2)}l{X,+j_iG7„(Ti,T2)} 

<3!(ri-r2) ^ ci^iCiA+j-i 

i,j>l,i+j<n+l^ ^ ^ 

<K{ti-T2). □ 

Lemma 6.2. Consider the first order stochastic equation with random 
coefficients of Example 3.1. Let {q-n), (m-n), {^n), {xn) he sequences of in- 
tegers such that g„ — > oo and qn = o(n), m„ — > oo and m„ = o{qn), oo 
and n5n^ oo and x„ — > oo as n ^ oo. Then for each I > 0, there exists a 
constant K such that 

\P{N^:\E) = - P^^\l)\ <k(^ + ^-^ + - + + ^LlS 

\qn n qn g„ ' n5n'' 

+ 5-^ + u,''-xi + x-^''-'Hn-Pl^\ 
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locally uniformly for t > 0, where < r] < 1, < if < 1, 0<^<k and < 
e < K. 



Proof. Write d{n,l) = \P{Ni^\E) = I) - p^^\l)\. Let 9, 



n X 



P{Ni^\Bn) > 0)/(Tg„), where Bn = (0;^„/n]. Let > 1, bei.i.d. integer- 

valued r.v.s such that 

^(Cff = m) = P{N(^\Bn) = m|iVM(i?„) > o), m> 1, 
and r]{9n\) be a Poisson r.v. with parameter ^l^V and independent of the 

(t) 

Q^. We have that 



4r) 



din, I) < 



p{Ni^HE)=i)-pi^ y: ci^=/j 



+ 



■p 



(r 



= :ll + lll. 

By using Theorem 2 in [30], we deduce that 



li < 2r^ + 2r*i + - min{6a2/^Jr); /3„,^„ (r)}, 



where 



/3„,Kt)= sup Esnv\P{B\:F[]})-P{B):B^P,^i^^,. 

l<t<n-l 



Note that, since (Xn) is a geometricahy absolute regular sequence, there 
exists a constant < r/ < 1 such that, for every choice of r > and I <l <n, 

By (1.3) and (1.4), we deduce that there exist constants Ki i > and 
1^2,1 > such that, locally uniformly for r > 0, 

II; < - 0o| + i^2,« E|vrM(A;) - vr(A;)| 



k=l 



l+l 



k=l 



j=k 



Let e'j^l = P{Ni^\Bn) = k\Xo > u„(t)). Note that 



- 0o\ < \0i^^ - <il + - ^ol =: Ila + IB 
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and for k>l, 

q-n 

j=k 



< 



By using the same arguments as for the proof of Lemma 2.4 in [33], we have 
for A; > 1 

\P{Nl^\Bn) >k)-{qn-k + l)P{N^;'\Bn) = k-l,Xo> n„(r))| 
< fcP(Mo,,„ > n„(T),M,„,2g„ > ^xn(r)), 
where Mij = max{X; : / = i + 1, . . . , j}. It follows that for A; > 1 

lie;- < k ^° ^-^(^^P'-?" ^ ^n(^)>^gn.2g„ > Un{T)) ^ k - I Op 



+ 



'k-l,n 



Qn 0, 



and 



Ila < nP(Mo,g„ > n„(T),Mg„,2g„ > ■«n(r))/(Tgn). 
Now observe that 
P{Mo,g„ > U„(r),Mg„,2g„ > Unir)) 

= ^({{Mo,g„-„^„ > n„(T)} U {M,„_^„,g„ > n„(T)}} n {M,„,2g„ > n„(T)}) 

< PiM,^-m„,g„ > Unir)) + (r) + P\Mo^g„ > Unir)) 



m 
n 

and, therefore 

i+i 



+ «n,m„(r)+ ( T— 1 ^0 



Ila + (Ilcfc + 114) <k(— + -an,m„ (r) + ^ + -) . 

\ Qn Qn Qn / 

Let cTfc = j:T=k = > ^) = /r ^(HO' > l ^ IU=1 > x-^l > X 

x~'^~^ dx. We have that O^-i = Cfc — for /c > 1. Then 

971 



116 < 



IIefc< 



j=k-l 



9M 



+ 



qn 
j=fc 



k>l. 



Let us define the probability measure, Qn-, on (l,oo) by 

Qn{dx) = P(K(r))-iXo G dx)/P{{un{T))-^Xp > 1). 
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As in [19], we introduce the process (A„) defined by Aq = and A„ = 
AnAn-i + Bn, n>l. We have that A„ > and X„ = Xq HiLi A + A„ for 
n > 1 . Let 

Bk{qn, {Aj)j=l,...,q„,Qn) 

p(^^^^l<j<qn■.Xol[A^ + AJ> Un(T)| 

> /c|K(T))"^Xo = xj Qn{dx). 
Note that P{Ni^\Bn) > k\Xo > n„(T)) = Bk{qn, (Aj)j=i,...,g„, Q„) and that 

< \Bk{qn, (Aj,)j=i,...,g„,(5n) - Bk{qn, (0)j,=l,. . , Qn) | 



j=fc 

=l,...,oo ) ) I 

= 1,...,00 5 = 1,...,00) 

We now consider successively each term of the upper bound: 
(i) On the one hand, we have that 



P (^ttj 1 < J < 'Zn : ^0 n ^^ + > ^n{r) | > A: 



(u„(r)) ^Xo = x 



X Qnidx) 



> p(^^^^l<j<qn:Xo Jl Ai > n„(r) | > A: 



(u„(t)) ^Xo = x 



X Qn{dx). 
On the other hand, we have that 
( ( i 



1 < J < g„ :Xo n A, + Aj > n„(r)| > /cj 

C jtij 1 < i < : I n > ^n(T)(l -5-')^ 

U{A,>5-1u„(t)}|>A; 



n 

j=i 



C ah < i < : n yl. > u„(r)(l - (^,-1) 
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+ tt{l <j<qn: > d~\n{T)} > k 



C [jmi<j<qn:Aj>d~\n{T)}=l} 



1=0 



n {Hi <j<qn: A,- > S~\n{r)} >{k-l)V 0}. 



Then 



1^°^ p(^^i^l<j<qn:XoJ[Ai + Aj>Un{T)^>k 



{unir)) ^Xo = X 



X Qn{dx) 



< p(^^i^l<j<qn:Xo n > Un{T){l - <5^i) I > 

iUn{T))~^Xo = Qn{dx) 

/oo 
Pm <j<qn: {A, > 6~\n{T)}} > 0)Qn{dx). 

Note that Aj < Xj for j >1 and, therefore, 

/oo 
^'(Hl < i < qn : {A, > <5„n„(r)}} > 0)Q„(dx) 



<P(M,„><5-in„(r))<K 



if g„ — > oo and ?i(5„ ^ oo as n ^ oo. Moreover, by a change of variable, we 
have that 



p(^^l^l<j<qn:Xo n Ai > n„(r)(l - 5^^) | > A: 



(ti„(r)) ^Xo = x 



X Qn{dx 
(l + o(l)) 



)"Ai-5n')-' V I i=i J / 



Since the density function of Q„, is uniformly bounded in a neighborhood of 
1, we deduce that J^^ ^" ^ Qn{dx) < K6~^ and it follows that 

\Bk{qn,{^j)j=l,-,qu^Qn) - -Bfe (^n , (0) j=l,...,g„ , Qn) | < K 
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(ii) Let f = EA{ < 1 for ^ G (0, k). We have that 



X Qn{dx) 



< p(^\l<3<qn:\{A,> x^' | > fe^ Qn{dx) 

+ P l^tt I j > g„ : n > a;"' I > 0^ g„ (c?x) . 
It follows that 

\Bk{qn, {0)j=l,...,q„,Qn) " Bk{oO, {O)j=l,...,oo, Qn)\ 

< (^tiji > : n > I > j Q„(dx) 

PXn / OO j \ 

^( V Y[A>X~^\Qn{dx)+Qn{Xn,(Xi) 



< E ^^4 + Qn(rE„,00)<^^ ^ + ^37 

by Chebyshev's inequality and Potter's bounds. 

(iii) Let fkix) = P(tJ{{j > 1 : 111=1 > > k). Since the distribution 
of Ai is absolutely continuous, fk is differentiable. Then we have 

Bk{00, (O)j=i,,,,,oo, Qn) - Bk{00, (O)j=i,...,oo, Q) 

fk{x){Qn{dx) - Q{dx)) = f'^\x){Qn{x, oo) - Q{x,oo))dx, 

where fl^^ is the first derivative of fk - But, by equation (3.2), sup2.>;^ \ {Qn{x, oo) x 
Q~^{x,oo) — 1)1 < Kn~^/'^ and we deduce that 

|Sfc(oo, (0),=i„„,oo, Qn) - 5fc(oo, (0),=i,...,oo, Q)| < Kn-P'\ 
Putting the inequalities together yields 

116 + V lieu < K + + (^""xl + x"^^-''^ + tTpI'^ 
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and the result follows. □ 

We now define the big blocks and the small blocks IJ by 

if' = {{j - l)rn + f , . . . , jVn - In], 
Ij = {jrn - + 1, • • • , J>n}, ] = K- 

Let us introduce the following r.v.s associated with the big and small blocks: 



= E l{X,>«.„(r)}, j = 1, . . . , fcn, 



"'71. • 1 ^ '"n.J J 



It is easily seen that pn\i) =pn^'^{i) +pn^'*{i) and pn^ =pn^'^ +pn^'* ■ 

To prove Proposition 4.1, we will need the three following lemmas. The 
first lemma can be derived from Lemma 1 in [7]. 

Lemma 6.3. Let pi, p2, Ps be positive numbers such that p^^ + P2^ + 
p^^ = 1. Suppose that Y and Z are random variables measurable with respect 
to the a -algebra J^'^^l^'i'^''^ respectively (l<m<n — l) and assume 

further that \\Y\\p^ = {E\Y\P^y/P^ < oo, \\Z\\p^ = {E\Z\p^)^/p^ < oo. Then 

I Cov(y, Z)\ < 10(a„,KTi, . . .,Tr))^/P' \\Y\\ 

Lemma 6.4. Suppose that (CO) holds. Let (r„) be a sequence such that 
r„ — > oo andrn = o{n). Then pn\i) ^ p'^^\i) andpn^-^T. 
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Proof. Since r„ — > oo, A(nr.„(r)) holds and there exists a sequence 
such that In — > oo, In = o(r„) and ar„, «„(''") 0. Let e > 0. By Chebyshev's 
inequahty, 

P(|pM'*(i)|>e) 

< ^-^ = ^ - , ivj:!r > o) + = Ni:]i* > o ) 

< 2£-^P{N^l]{* > 0) < 2e-ip( y {Xi > n,Jr)} ) < 2e-V/„/r„ ^ 

and 

>e)< s-'EiN^lY) < e-'rUrn ^ 0. 

Hence, Pn^'*{i) and pn^'* —> 0. Now let us show that Pn^'^{i) ^ p^'^\i) 
and pn'''^ r. Since Iim„^oo -P(^'^^i*i* = i) = and lim^^oo -E(ivJ^|{*) = 
0, we deduce by condition (CO.b) that lim„_»(-5o 

P{Nl'^^f =i)=p^^\i) and 
lim„_»oo ~ Therefore, it suffices to show that 

We have 

p(ip(:)'^(.)-p(iv5^=i)i>e) 

<e''E{p^^^^(^)-P{Nl^Jf = ^)f 

<2{Ke)-^ ^ |Cov(l 1 M,A )|. 

By using Lemma 6.3 with pi = oo, p2 = oo, p^ = 1, we get 
P(|pM'^(.)-P(iv5^ = i)|>e) 

< K{kne)-^\^n + 51 (^"n -i)«r„,/„+(i-l)r„(^)j II l{7vM-^=i} H ^ 

<Jfe-2(fc-i (r))^0. 

In the same way, by using Lemma 6.3 with p\ = p, P2 = P, Ps = — 2), 
we get 

Pi\p(^^^^-EiN^f)\>e) 

<e~'E{p^^)^^-E{Ni2{'')f<2{kner' E |Cov(<f,<f)| 
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Observe that sup„>i £;(iV,^^^i)/' < oo by condition (CO.b) and 1 - 2p-'^ > 
to conclude. □ 

Lemma 6.5. Suppose that (CO) holds. Let (r„) be a sequence such that 
r„ — > oo and Vn = o{n) . Then 

(p(,)(0), . . . ,p(;)(m),p(;)) ^ (/)(0), . . . ,/)(m), (•)) in D^+\ 

Proof. Let us first recall that convergence in Z)™+2 is equivalent to 
convergence in D^^"^ for all choice of positive a and (/>, O<cj<0<oo. 
Moreover, since (^'-'•'(0), . . . ,p^'-*(m), (•)) is a deterministic element of D™^^, 

we only need to prove that (i) =^ p^'-' (i) in D^^, i = 0, . . . ,m, and pV =^ 
(•) in D^^. By Theorem 13.1 in [3], it suffices to prove that the finite- 
dimensional distributions converge and that a tightness criterion holds. It is 
easily seen that the first condition is satisfied by using Lemma 6.4. We only 
need to check that the {pn\i))n>i, i = 0, . . . ,m, and {pn^)n>i are tight in 
D^^. Following Section 12 in [3], we call a set {rj} a (5-sparse if it satisfies 
a = tq < ■ ■ ■ < Tw = (p and mini<j<u;(ri — Tj-i) > 6, and we define for q G D^. ^ 

w'{q,5) = ini max sup \q{s) — q{t)\. 

{ti}l<j<Ws,tG(ri_i,r,] 

By using Theorem 13.2 in [3] and its corollary, Pn\i) is tight in D^^ if and 
only if the two following conditions hold: 

(i) for each r in a set that is dense in [a, (j)] and contains a, 

lim limsupP(p^^'*(i) > a) = 0, 

a — >oo ^ 

(ii) for each e > 0, lim^^^o limsup„ P{w'{pn\i),6) > e) = 0. 

Condition (i) is satisfied since p^T^ {i) ^ p'^'^^ {i) < 1 for each r G [a, cp] (by 
Lemma 6.4). Let us now consider condition (ii). Let 6 < (j) — a and define 
Ms = [(0 - cr)5-ij + 1, rf = cr + W for < / < Ms and t^^^ = (p. Note that 

r 1-^ J2)=QPn\j) is a nonincreasing function, and then 



sup 



j=0 j=0 
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£ 



It follows that 

\j=0 I ^j=0 

If I > 1, we have 

P(u;'(pl:)(i),5)>e) 

<p(^^max^X(P«^"'^(^VPn'^(j)) > 

If i = 0, we have 

P{w'{p'^\i),6) > e) < P^nax^(p^-^\0) -pi^' V)) > ^) • 
By using Lemma 6.4, we get 

max max j^ip^^'-^Kj) - p^^'Kj)), 

which is less than e/2 for small 5, since r i-^ X]j=o ^ continuous and 

bounded function on [a, tp] . Thus, we deduce that 

lim limsupP(w'(p,^^(i), 5) > e) = 0. 

Condition (ii) is satisfied and pn\i) is tight in D^^. 

Now note that r ^ pn^ is a nondecreasing function and dp^'^^ /dr = 1 . The 
arguments for pn^ run similarly. We conclude that {pn\o) , . . . , pn\m) , pn^ ) 
weakly converges in D^^^'^ , and then in D™"'"^. □ 

Proof of Proposition 4.1. The generalized inverse of pn"* is given by 

r > 0: ^ l{X.>F-{l-r/r„)} > knf\ =rnF{Xk„r„_ik„f\:kr.rJ 

since F^(F(Xi. „ ^i.i, „ )) = Xu „ =1 . i. . It is a caglad function 
on [c7, Note that for f G [o", and such that [A;,„rJ < A:„r„, 

j5^'^^(r77) =p^P" ^' )(r7i), m>0. 
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Let ^|,o-,<}!) (resp. D'^'f^, C^,a,<f), be the space of nondecreasing func- 

tions from [cr, (j)] to R (resp. nondecreasing functions from [cr, (j)] to [a, (^], con- 
tinuous nondecreasing functions from [a, (j)] to R, continuous nondecreasing 
functions from [cj, (j)] to [a, (j)] ) . 

Let us introduce the map T from Di^^„^^ to D'^'f^p taking h into max((T, 

min(/i^, (/))). It is continuous at C"'f^. Let us denote by p\^f^ the function 

T(pn^). By Lemma 6.5 and the continuous mapping theorem (CMT), it 

fohows that ^ ^((•)) = (•) ^"{U 

Moreover, the composition map from D'^J^^ x D'^'f^ to D^^^ taking {g, h) 

into g o h is continuous at {g, h) G C™^"^ x C^'f^ (see, e.g., [2], page 145). It 
follows by the CMT that 

ipt''''^\0), . . . Jf'^\m)) (p(-)(0), . . . ,p(-)(m)) 



in DZ^t^. Now we have 



sup \pf'^^\j)-pP^\j)\ 

Te[a,0] 



< sup \p^;:\j)-pP{j)\i,.i.).^^. 

V sup \p'^n\j)-p'i\m<^W,^^^y 

Since the weak limit of {pii {j))n>i is continuous at a and (f), p^^'^ a and 
pit^'^ we deduce that 

sup |p(f^^'''^)0-)-p?-'^\j)|^0, 

tG[(T,(/)] 

or, equivalently, pn (j) — (j) =^ in D^^. Finally, we get 

(p«(0), . . . ,p^\m)) (/)(0), . . . J-\m)) in 

To prove weak convergence of (7rl^(l), . . . ,7rl^(m)) in D^^, we proceed 

by induction. First note that by Lemma 6.5 limn^oo P{\pn'\o),pl^\o)] G 
(0, 1)) = 1. We deduce by the CMT that 

/)(i) = = Wl) 
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7ri)(l) = max(0,min(x^)(l), 1)) 7t{1) 

in Dl ,p and 

(p»(0),p»(l),7ri-)(l)) (/)(0),/)(l),7r(l)) 
in D^^. Now assume that we have already shown that 

^(p(-)(0),...,/)(j),7r(l),...,7r(i-l)) 

in D^-^. Let us define the maps ^'^ from dI^^^ to Dl^^ taking /(•) = (/i(-))i=i,...,2j 
into 

^ . +J-Mn(/l(•))E^^l^Wl(•)/i-i+l(•)) 

ln(/i(-))/i(-) 

Note that 

= ^M\0)^ ■ ■ ■ 7r(-)(l), . . . , 7r(-)(j - 1)) 

and that is continuous on the space of continuous functions from [a, (p] 
to (0, 1) X It follows by the CMT that Xn\j) vr(j) in D^^^. Let us 

recall that 

(j) = max ^0, min ^x^i^ 0'), 1 - J2 (0^ ^ • 

We conclude by the CMT that 7rl'^(j) =^ 7r(j) in D^^^ and 

(P^^^(0),...,p^;)(j + l),vr»(l),...,vr»(j)) 
^(p(-)(0),...,/)(j + l),7r(l),...,7r(j)) 

in d'^^^^^K The induction is established and 

(4)(l),...,7r(-)(m))^Kl),...,7r(m)) 
in D'^^. Finally, by using again the CMT, we deduce that 

in D^j^^ 7tn{ra) 7r(m), m > 1, and ^i^n ^ 0. □ 
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Let us now define 



^m.ni'^) ~ (^0,n (''")' • • • ' ^m,n (''")' (''")) ' 

^m,n (t") = (e5,„ (t) , • • • , e;;,^„ (r ) , e; (r ) ) . 

We have ej,„(-) = e^„(-) + e*„(-) and en(-) = e^(-) + e*(-). The proof of 
Theorem 4.1 is now presented in a series of three lemmas. 

Lemma 6.6. Suppose that (C2) holds. Let r > 0. Then £^^^{7) 0. 

2 /t 

Proof. By (C2.c), there exists a sequence {In) satisfying /„ = o(rn ) 
and lim„„>oo "^^K^n — 0- We have that 



Eielnir)) 



2 



+ 2K'e{j:{1 - PiNi^f = ^,Nl^;; > 0))' 



= :2(Ii+l2). 

Let 2 < f < r. By using Lemma 6.3 with pi = v, p2 = v, ps = v/{v — 2), we 
get 



kji 1 



{Ar("''.^=i-ArM'.*,ArM'.*>0}'^{Af(");^=i-Af(");*,Ar'");*>0}^ 

l<j<l<k 



< KA;„ M A;„ + ^ {kn - i)(ar„,(j-i)r„ (t)) 



X 


rn,l 




/ kn~l 


<k\ 






V j=i 


<K\ 


/ oo 




\ j=0 
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r \\\2 V ^ T.^ 'n \ 



since Ej^oaj"^" ' < o^- Similarly, I2 < K(/„/r„)2/^ Therefore, 
P{\eUr)\ >e)< e~^E{el^{T)f < K{ljr^f^ - 0. 
By using Lemma 6.3 with pi = v^P2= Ps =v/{v — 2), we get 



xiiiv(:;r-^„,FK„(r))ii^. 

By Theorem 4.1 in [40] [equation (4.4)], we have 

Putting the inequalities above together yields E*^^{t) ^0. □ 

Lemma 6.7. Suppose that (CI) and (C2) hold. Let r > 1 and ri > • • • > 
Tr- > 0. T/ien 

{Em,n{Tl), ■ ■ ■ ,Ejn,n{Tr)) (^^(n), • • • , -E'm(r^)). 

Proof. Since by Lemma 6.6 ^(r) 0, we only prove that 

(i?^,„(ri), . . . , < Jr,)) ^ (i^„(Ti), . . . , i5;™,(r,)). 

By applying the Cramer-Wold device, it suffices to prove that, for A^j G 
R, h = 1, . . . ,r and i = 0, . . . , m + 1, 

r / m \ r / m \ 

(Th) ] ^Y\Y1 ^h,iei{Th) + Xh,m.+ie{Th) . 
h=l \i=0 I h=l \i=0 I 
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Let 

r m 

I -I ■ ^ I rn , 7 J 



h=li=0 
r 



/i=l 

By using recursively Lemma 6.3 with pi = oo, p2 = oo, = 1, we get 



which tends to by condition (C2.c). This implies that the fj^n can be 
considered as i.i.d. r.v.s. By condition (CO.b) and Minkowski's inequality, 

linin-^oo E\fj^n\^ < OO where p> 2. Therefore, 



and Lyapounov's condition holds (see, e.g., [4], page 362). It follows that 
Z]f=i /i,n converges in distribution to a standard Gaussian 

random variable. 

By Condition (CI), (M^'^(^), iV^'^(^)) ^ {'N%^'^ .N^""^) and the limiting 
second central moments of the r.v.s 1, (tu),a , and N^J^)'^ , h= l,...,r, 

=*> '•n,l ' 

exist. Simple calculations yield the covariance functions given in Theorem 
4.1. □ 

Lemma 6.8. Suppose that (CI) and (C2) hold. Then {Emn{'))n>i is 
tzghttnD^f. 

Proof. We use similar arguments as for the second part of the proof 
of Theorem 22.1 in [2]. The tightness criterion which is considered is the 
following (see Theorem 15.5 and Theorem 8.3 in [2]): {Emn{-))n>i is tight 
in D^^f if: 

(i) for each positive 77, there exists an a such that 

P{\Em,n{^)\i>a)<r], n>l, 

where = E,"LV l^jh 

(ii) letting e > and i] > 0, there exists 6 > and an integer no such that 



P( sup \Em,nin) - Em,n{r2)\i> e] <r]5, n>no, 

\T2<Tl<T2+S 



for all T2 G [cT, 



CLUSTER SIZE DISTRIBUTION OF EXTREME VALUES 



35 



Moreover, by Theorem 15.5 in [2], it fohows that the weak hmit of a 
subsequence Em,n'{-) belongs a.s. to C^^^^. 

Condition (i) is satisfied since Em,n{<P^ 
tion (ii). Note that 



Pi sup \Em,ni'^l) - Em,niT2)\l > £ 
\t2<ti<T2+S 



Let us consider condi- 



m 



i=Q 
+ P 



sup |ej,„(ri) - ei,„(T2)| > — - 

T2<Ti<T2+(5 m + i 



sup |e„(Ti) - e„(r2)| > — ^— 

T2<Tl<T2+(5 m + i 



1=0 

+ P 



sup 

T2<Tl<T2+S 



j=0 



> 



2(m + l) 



e 



sup |e„(ri) - e„(r2)| > — - 

T2<Tl<T2+5 m + i 



and it suffices to check the tightness criterion for each X]j=o ^i,"(")5 ''■ — 
0, . . . , m and for e„(-). Now we simply indicate the modifications to be made 
in the proof of Theorem 22.1 in [2] to estabhsh that condition (ii) holds. 

Let 2<u<p<r<C)0 and e > 0. Assume that 0^ > v/{v — 2) and 6^ > 
{p-l)r/{r -p). 

(i) Let a <T2 <Ti < (p and define 

Si{Ti,T2]kn) := \/k^(^{ej,n{Ti) - ej,n(7"2))j • 

By Theorem 4.1 in [40] [equation (4.3)], we have that 
E\S,{n,T2;knW 

< K{kpj\p{4:^} < ^ < N^zhf"" + ki+%p{4:^} < ^ < N^:l)f'') 

< K{k^j\E{Nii^i - ^i))"^" + ^^'(^(<"i - 

< K{kll\Ti - T2Y'- + fci+^(ri - r2)P/^). 

Let r/ = p/2 - (1 + e). If < e < 1 and e/kl < (ri - r2)P(V''-i/r-) ^ ^e get 



E\ 



^{ej,n{Tl) - ei,n,(T2)) 
3=0 



<Ke~^(ri-r2)P/" 
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which replaces equation (22.15) of [2]. 



(ii) Let := (<:] - - {EK2> - EN^^')) and define 

S{Ti,T2]kn) ■■= VKienin) - e„(T2)) = ^(,j,n- 

i=i 

By Theorem 4.1 in [40] [equation (4.3)], we have that 

E\S{n,T2; < K(A;P/2||6,„||S + A:Ml6,n||?). 
Now for v>2, 

i6,nr < 2''((ivi::) - + - en^zIt)- 

For large n and for cr < T2 < ti < (j), 

\Ci,n\''<K{iN^2l-N^:'}r + in-r2)). 

By condition (C2.a), we get i?(|^i,n|'^) < K{ti — T2) for 2 < A < r and we 
deduce that 

Therefore, if e < 1 and tjkl < (n - rs)^^^/""!/^) , we have that 

E{\en{n) - en{T2)n < Ke-\Ti - T2rl\ 

which also replaces equation (22.15) of [2]. 

(iii) We replace equation (22.17) in [2] by 



r{T2) 



(n) 



3=0 



< 



^iej,niT2 + 6)- ej,„(r2)) 

j=0 



|en(Tl) - en{T2)\ < |e„(r2 + 6)- e„(T2)| + d^/kr, 

for T2<Ti<T2 + 5, by using monotony arguments as in [2]. 
(iv) We need to replace (22.19) of [2] by 



rv/ {p{r—v)) 



and to assume that 



rjrv 



rv 



p{r — v) {r — v)\2 



1 (1+e) 



> 



Since 0d has to be larger than (p — l)r/(r — p) which is increasing in p and 
p > V, we let p = v{l + e) and choose v such that 

T (l + e)\ 1 



rv 



{r-v) \2 



p 



-(l + e). 
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It follows that V = (3 + e)r/(r + (1 + e)). Then the inequalities 9d > v/{v — 2) 
and Od> (p — l)r/(r — p) become 

e,>i±£^ and ,^-,(P + er-2)r-(l + e) 



l + er-2 r-(2 + e)(l+e) 

which are satisfied if e < ((r — 2) A l/2)/4 and 



r-2(l + 2e) 

Everything else remains the same as for the proof of Theorem 22.1 in [2]. 
Finally, choose /i = 4e. □ 

Proof of Theorem 4.1. Weak convergence in of a stochastic 

process is equivalent to weak convergence of the restrictions of the stochastic 
process to any compact [a, <f>] with 0<cr<(/><ooin -D™^^ . The convergence 
of the finite dimensional distributions of -Em,n(") is established by Lemma 6.7 
and the tightness of (-Em,n.('))ra>i 

0"^+"^ by Lemma 6.8. Weak convergence 
in D^^'^ follows by Theorem 13.1 in [3]. By Theorem 15.5 in [2], we deduce 
that □ 

Proof of Theorem 4.2. Let 

Since sup^g[^^^] \^/k^{P{N^l\ = j) -p^^^j))! ^0 [by Condition (C3)], we 

deduce that (eo,„(-), • ■ • , em,„(-)) =^ (eo(-)) • ■ • > em(-)) in -0™^^ By using the 
function T, the composition map, the same arguments as in the proof of 
Proposition 4.1 and Theorem 4.1, we deduce that 

in D^t^ Now note that 



sup \ej,n{p);''^) - ej^niPn'b )l 
T&[a,(f>] 



< 



sup |ej.n(r) -ej.„(r)|l, (a), 



-<a} 



V sup |ej,n(-r)-ej,„(r)|l (^),^ 



Since the weak limit of (ej,n(-))n>i is continuous at a and (f), pn cr and 

it follows that sup^g[^,^] |ej>(pn^'^) - ^i.n{p^n^h~)\ ^'^'^ t'^^t 

en(-) := V^(p(-) - (•)) = en(-) + V^(rnFK„(-)) " (•))• 
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By Condition (C3), sup^gf^,^] Vftn\i"nF{ur„{T)) — r| ^ 0. It follows by The- 
orem 4.1 that ert(-) =^ e(-) in D^^. Now by using Vervaat's lemma [42], we 
get 

We deduce from the differentiability of p^'\j) and the finite increments for- 
mula that 

in -D^^. Finally, we get 

ej>(-) = {ej,n{pl^^'^) - ej,n{pi]r)) + ej,n{Pn,r) 

+ VKiP^^""~Hj)-P^-Hj)) 

^ej{-)-hj{-)e{-) = ej{-) 

in Dl ,p and 

(eo,n(-),---,em,n(-))^(eo(-),---,em(-)) ^ I?^^'- □ 

Proof of Corollary 4.1. We first recall that a map T between topo- 
logical vector spaces Bi, f = 1,2, is called Hadamard differentiable tangen- 
tially to some subset S C Bi at x £ Bi ii there exists a continuous linear 
map T'{x) from Bi to B2 such that 

T{x + tnyn) - T{x) _^ _ ^ 

in 

for all sequences in i and yn £ Bi converging to y G S. Note that the map 
introduced in the proof of Proposition 4.1 is Hadamard differentiable 
tangentially to C^"|^ at / G C^-^^ and that 



"^^■^^^•^^ • ^^-^ = l(in(/,(.))/i(-))^ - -Air >^^-^ 



1 ^'-^ 



j7i(-) ^ 



Xl(-?'-*)/2j-i+i(-)5'i+i(-) 



1 



We now proceed by induction. By Theorem 4.2, 
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in D^^^. First, we deduce by the J-method (see Theorem 3.9.4 in [41]) that 

VKixkHl) - ^(1)) ^;(/)(0),/)(l)) • (eo(-),ei(-)) = w^{-) 
in D^^. Then 

= max{-y/k^Tr{l),mm{Vk^{xii\l) - 7r(l)), - 7r(l)))) di{-) 

in Dl ,p and 

(eo,n(-)>ei,n(-),c^i,n(-)) ^ {eo{-) , ei{-) , di{-)) 
in ^ . Assume that we have already shown that 

(eo,n(-); • ■ • > ej^n{-)Ji{-), ■ ■ .,dj-i{-)) (eo(-), • ■ • , ej{-),di{-), . . .,dj-i{-)). 
The (5-method yields 

VKixkHj) - vr(j)) ^'M^iO), . . . vr( )(l), . . . " 1)) 

X {eo{-),---,ej{-),di{-),...,dj-i{-)) 

in ^ , and a straightforward computation shows that the limit is equal to 
Wj{-). Let us recall that 

dj,n{-) =max^-v/A~7r(j), 

min^V^(Xn^(j) -7r(i)),yfc^^l-^7r(i)^ j> 

where i^j^ni') = J2i=i di,n{-)- It follows that dj,n{-) =^ '^j(-) in -D^^^, and 

(eo,n(-)> • • • > ej+i,„(-),di(-), • • • , '^j(-)) (eo(-), • • • , ej+i(-), di(-), • • -Jji-)) 
in The induction is established and 

cr,0 

(dl^ni-), - ■ ■ ,dm,ni-)) =^ (dl (•),..., dm (•)) 

in D;;^^. By the CMT, we deduce that 

d / 1 ff" - 1 ft" - \ 

{di^n,---,dm,n) [^ ^ _ ^ J ^1 (t) <iT, . . ■ , ^ _ ^ J dm{T) dr J . ^ 

Proof of Corollary 4.2. The assertions foUow from the (5-method 
and the CMT. □ 
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